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Abstract. For a sample of absolutely bounded i.i.d. random variables with a con- 
tinuous density the cumulative distribution function of the sample variance is repre- 
sented by a univariate integral over a Fourier-series. If the density is a polynomial 
or a trigonometrical polynomial the coefficients of this series are simple finite terms 
containing only the error function, the exponential function and powers. In more 
general cases — e.g. for all beta densities — the coefficients are given by some series 
expansions. The method is generalized to positive semi-definite quadratic forms of 
bounded independent, but not necessarily identically distributed random variables if 
the form matrix differs from a diagonal matrix D > only by a matrix of rank 1. 
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1. Introduction 

There is a lot of literature on the approximation of distributions of sample statistics 
for small samples. With the increasing technical availability of transcendental functions 
it becomes easier to compute exact representations for many distributions. For the 
investigation of the accuracy of any proposed approximations to a distribution it would 
also be helpful to have an exact representation and not to rely only on Monte Carlo 
methods. 

In particular, for the exact distribution of the sample variance s 2 from non-normal 
random variables only a few scattered results seem to exist. E.g. Mudholkar and Trivedi 
(1981) recommend transformations of the Wilson-Hilferty type to approximate the dis- 
tribution of s 2 . Former work is cited in this paper. In Royen (2007a), (2007b) several 
different representations are found for the cumulative distribution function (cdf) of s 
from a gamma parent distribution. The first paper also contains an orthogonal series 
for the cdf of Greenwood's statistic (the square sum of "spacings") as a by-product. In 
the second paper a representation of the cdf of s 2 from a uniform distribution is given, 
which is a univariate integral over a Fourier series. The method for this special case is 
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generalized in the underlying paper. 
Let be 

n 

Q = Q n = (n-l)s 2 = ]T(X,-X) 2 , (1) 

i=l 

where Xi,...,X n are bounded i.i.d. random variables with a continuous density / 
satisfying some mild regularity conditions. Without loss of generality the interval (0, 1) 
(or (—1,1)) can be chosen as the support of /. By geometrical considerations on the 
n-unit cube it can be shown that 



n/4 , n even 
(n 2 - l)/(4n) , n odd 



g:=SU P Q„ = { f 2 i \ n a~\ _ ( 2 ) 



An estimation for the order of magnitude of the cdf _Fq (x) of Q for small 2 in the 
following section suggests for the density /q of Q 

lim = lim / (a;) = 0, (3) 

already for small values n if f(x) does not increase "too rapidly" for x tending to or 
to 1. In this case a Fourier sine expansion for Jq would be suitable and the formulas in 
the theorems in section 2 and 3 aim at this case. Nevertheless, the condition (3) is not 
necessary for the validity of the representations in these theorems. Besides, they can be 
modified easily to apply to general Fourier series. 

Theorem 1 in Royen (2007b) provides a general formula for the Laplace transform 
(LT) of a quadratic form X' AX of X' = (X\, . . . ,X n ) with an " m-factorial" matrix 
A = D^fCC, where D > is diagonal and C is an n x m-matrix of rank m. Q is a 
one-factorial quadratic form with the identity matrix D and C' = n _1 / 2 (l, . . . , 1). Since 
Q is bounded its LT is holomorphic and its characteristic function (cf) /q is obtained 
from the LT by the substitution t — > —it. With 

ip(t, z) := £ cxp (-(z + ^J~Tt Xf^j , fei, zeC, (4) 

it follows from the above formula that 

f Q (t) = y/^ ^ m,y)) n dy. (5) 

J — OO 

This identity is also directly obtained when expfw 2 ) is replaced by the integral 

/oo 
exp(— y 2 — 2wy)dy = exp(w 2 ) 
-oo 

with w = Ht/n) 1/2 Ej=i^' in thc cf of Q = £™=i X] - (E"=i^)7™ aft er the 
substitution y — > y^/n. 

The Fourier sine coefficients of /q are given by 

,„ , .„ , , 1 Akn/n , n even I . . 

2/g)9fm(/ t fc , t k = kn/q = I ' ' . (6) 

I 4kmr/(n z - 1) , n odd J 
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However, the formula (5) is numerically useless since 

max exp (-(y + x^ftjl - ix\JT/2 ) 2 ^j = cxp(te 2 /2) 

increases very rapidly for large t. Generally, integration over exp(— z 2 ) should be avoided 
in the "critical area" defined by |3?e(z)| < |3m(z)|. This is accomplished in (5) e.g. by 
changing the path of integration with t = t k to 

7fc ;= {y - V^itii | y < o} u {uV^ih | - 1 < « < 0} u { y \ y > 0}, (7) 

where j k has been identified with the set of its points for simplicity. For a density / on 
(— 1, 1) 7fe is defined by 

{y + sgn(y) % /^ | y e K\{0}} U {uV^ \ - 1 < u < 1} 
It should be noted that 

ip*(t,u) := ipfau-^ii) = £ cxp(it{X + u) 2 ) (8) 
is the cf of (X + u) 2 . 

If the density /q is square integrable (/q e £ 2 ) then an absolutely and uniformly 
convergent series for the cdf of Q follows from (5) and (6) with the changed paths -fk by 
integration over x: 

oo 

Fq(x) - P{Q <x} = {2/7r)J2%™(fQ(tk)){l-cos{t k x))/k, < x < q, (9) 

fe=i 

where the terms free of x sum up to 1 — £(Q)/q, and 

f Q {t k ) = ^/^ ( m klZ )) n dz. (io) 

The more elegant formula 



/g(ffe) = Vn/nV^h (r(tk,u)) n du (11) 

will also be proved in section 2, but formula (10) is numerically more attractive because 
of the rapid decrease of exp(— y 2 ), y G R. 

Finally, a univariate integral representation for Fq arises if the change of summation 
and integration in (9) can be justified. Then the integrand is a Fourier series with simpler 
coefficients, and the higher partial sums of this series exhibited a very smooth appearance 
in all the plotted examples. The proof of this final representation for Fq is simple if the 
parent densities belong to the class 

BVC[a,b]:={f:x£[a,b]^R, f E C[a,b], f e C\a,b)}. (12) 



Thus, / e BVC[0, 1] means / is continuous and consequently bounded on [0, 1] and of 
bounded variation. Densities from this class will be treated in section 2. If the density 
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/ of X is a polynomial or a trigonometrical polynomial then ip(t, z) is representable by 
simple finite terms, containing only the error function crf(z), |3Je(z)| > |3m(z)|, the 
exponential function and powers. For more general densities some different series repre- 
sentations of tp will be given. Section 2 also contains a generalization to the distribution 
of a one-factorial positive semi-definite quadratic form Q of bounded independent, but 
not necessarily identically distributed, continuous random variables. To include also 
unbounded densities, e.g. the whole family of beta densities, a broader class will be 
investigated in section 3. 

Formulas from the handbook of mathematical functions by M. Abramowitz and 

1. Stegun are cited by (A.S.) and their number. The symbol £ stands for "expectation". 

2. The distribution of Q for bounded parent densities with an absolutely 
integrable derivative 

Theorem 1. Let X be a random variable with a density f G BVC[0, 1] as defined 
in (12). The cdf Fq of Q — Y^i=i(Xi — X) 2 from a corresponding random sample is 
given for all n > 2 by 

/oo oo 
Y,%m(y/=itk~(ip*(t k ,u)) n ) (l-cos(t k x))/k du (13) 
-°°fe=i 

with the cfip*(t, u) of (X + u) 2 and t k = kn/q, q = supQ from (2), or by 

F Q {x) = 2ir- 3/2 Vn~ / V 3m (^itk~ (ip*(t k , -u)) n ) (1 - cos(t k x))/k du 
"'o fc-l 

(14) 

+2tt- 3 / 2 V^ / ^2%m((iP(t k ,y)) n + (ip(t k ,-y- (1 - cos(t fc a;))/fc dy 

Jo k=i 

with ip(t, y) = £ cxp ^— (y + V '—it X) 2 ^j . 

The density of Q is square integrable for all n > 3 and it is obtained by the derivative 
under the integral at least for all n > 4. 

Remarks : For numerical evaluations the upper limit of integration y = oo in the 
second integral in (14) can be lowered to small values yo, depending on n and /, because 
of the very rapid decrease of the integrand. 

If /(1/2 + a;) = f(l/2 — x) then the integral in (13) can be replaced by 2 J_^ 2 ■ • • du, 

1 /2 

the first integral in (14) by 2 f ' ... du, and it is 

(ip (t k ,-y - V-it k )) n dy = / (ip (t k ,y)) n dy. 



Before the proof we look at the order of magnitude of Fq(x), 1 — Fq (q — x) , x — > 0, 
if / is only supposed to be bounded. In particular, for the uniform distribution on [0, 1] 
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Fq{x) is asymptotically given by the volume of a thin cylinder whose axis goes through 
(0,...,0) and (1,...,1). Thus 

F Q (x)~ V^^n-l^ 1)/2 , X^O, 

with the volume 6 n _i of the (n — l)-unit ball. There are exactly (1 + n — 2[n/2]) 

corners of the n-unit cube with maximal distance ^Jq from this axis. If x approaches 
q then 1 — Fq (x) is asymptotically given by the volumes of small simplices with these 
corners at their tops, which leads to 

n \ 1 / n N 



[n/2] 



1 - F q{q - X ) * (1 + n - 2[n/2]) ^ ^ x«, ^ 0. 

Therefore, we obtain for any bounded parent density / on [0, 1] 



(x) = O (V"" 1 )/ 2 ) ,l-F Q (q-x) = 0(x n ), x^O, (15) 



with O-constants depending on / and n. 

With some additional regularity conditions for / we can expect for the corresponding 
density f Q 

f Q (x) = O (x("- 3 )/ 2 ) , f Q (q -x) = (x™- 1 ) , x - 0, (16) 
and therefore Jq(x) — > 0, x — > 0, if n > 4. Then a Fourier-sine series for Jq is suitable. 

The proof of theorem 1 is accomplished essentially by the following lemma: 

Lemma 1. Let f be a probability density within BVC[0, 1]. Then the following 
bounds for i[)(t, y), ip*{t, u), t > 0, y > 0, are obtained: 

m,y)\,W,-y-^\ < Ct-^mm^y-'e-y 2 ), (17) 
\tp*(t,u)\ < d r 1 ' 2 , ue R, (18) 
\ip*(t,u)\ < C 2 t~ 1 \u\- 1 ,\u\>2, (19) 

with suitable constants C, C\, C2 depending only on f. 

Furthermore, with t = v 2 tk, tk from (6), v > 1, 

/ |"0 (tk,± (y + vy/-it k ))\ n dy = 0{v~ n ), v -> 00, 
Jo 



(20) 



\r(tk,u)\ n du - 0(t- {n - 1)/2 ) , (21) 

-00 

and 

fQ(t k )\=0(t-^- 1)/2 ) (22) 
with O-constants depending only on f and n. 
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Proof. By partial integration we find 

i>(t,y) = \M-it)~ 1/2 (erf (y + yfTi) - erf (y)) /(l) 
1 f 1 

-^Vi^Ji J (erf (y + xV^l) -crf(y)) f'(x)dx, 

and therefore 

IV(*,y)l <Ci t- 1/2 , 

since erf (2) is absolutely bounded on {z\ \dte(z)\ > \$$m(z)\}. 

With |exp(- (y + xv^it) 2 ) \ = | exp ( - (y + x^/t/2 - ix^ft/2) 2 ) \ 
— exp ( — y 2 — xy^/2t) we get 

m,y)\<C 2 t-^y-'eM-V 2 )- 



(23) 



(24) 



Together with (23) this implies (17) for ip(t,y) with C = max(Ci,C2). 

Since ip(t, — y — f(x)) = ij){t,y;f{l — x)) the bound in (17) also applies to 

i>(t,-y- ^fTt). 

The bound in (18) is found by the same way as in (23) with y replaced by uy/—it. 
If \u\ > 2 we have \u + x\ > \u\/2 and using (A.S. 7.3.22) we obtain 

erf (y=E(u + x)j - erf (^/^i^ uj < C 3 * — 1/2 — 1 , 

and consequently the bound in (19). 



For the proof of (20) we use 

J exp ^— (z + y/—it + xy/—itkj 
< J exp (-z 2 - zV2t- xz\/2t^) dz < J exp (-zV2t) dz = (2t)~ 1 
and consequently 

erf (y + y/^E + x^-it k ^J - erf (y + V^itj 

= erf (y^it + x^f^ihj - erf (V^iifj I + Oir 1 ' 2 ) = 0(t~ 1/2 ), 



dz 



/2 



to obtain again by partial integration 

\i>{t k ,y + vs/=ih)\ =0{t- l ' 2 ) = 0{v- 1 ), v^oo, 
with an O-constant depending on / and not on k. Besides 



(25) 



ip (t k ,y + V^ii^j < J exp ^- (y + \f^it + x^-it k ^J 
< d exp (-y 2 - yV2t) (yV^)" 1 ■ 



f(x)dx 



(26) 
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Now the integral J °° \ip\ n dy is splitted over the intervals < y < (2t k ) and 
(2t k )^ 1 / 2 < y. Following from (25) and (26) the contribution of these integrals are 0{v~ n ) 
and 



O (V /2 exp(-nvy)y- n dy > j = O (e~ nv ) , v -» oo, 



which provides the assertion (20) for the upper sign. The proof for the lower sign is very 
similar. 

The relation (21) follows immediately from (18) and (19). Finally, 

\f Q (t k )\ < vw^ [ m k ,z)\ n \dz\ 

with "fk from (7), and therefore the assertion (22) follows from (17) and (18). □ 

Proof of theorem 1 : The Fourier cosine coefficients of Fq are given by 
—2/(qt k )^sm(fQ(t k )) and with (22) from lemma 1 we obtain for all n > 2 the abso- 
lutely and uniformly convergent series for Fq, already given in (9), (10). The square 
integrability of /q for all n > 3 is a consequence of (22). The change of summation and 
integration in (9) is justified by the majorants provided by (17) and (18) which proves 
(14) and gives the density /q by the derivative under the integral at least for all n > 4. 
To prove (13) the paths ^ k are modified to 

Jkv ■■= {y - vw k | y < 0} U {uwk | - v < u < v} 

U {y + vw k | y > 0}, w k = V^itk, v > 1. (27) 

Then 



f (TP(t k ,z)) n dz = w k f (i;(t k ,uw k )) n du + 0(v- n ) 



because of (20). Therefore, equation (11) follows for v — > oo. Now the representation 
(13) follows from the majorants obtained by (18) and (19). The density /q is obtained 
again by the derivative under the integral at least for all n > 4. □ 

The representation in (5) for the cf /q is a consequence of the " one-factor" structure 
of Q. Therefore, theorem 1 can be generalized to one-factorial quadratic forms. 

Theorem 2. Let Q = X'(D =p cc')X be a one-factorial positive semi-definite 
quadratic form with n bounded independent — but not necessarily identically dis- 
tributed — random variables Xj with densities fj G BVC[aj,bj], aj < 0, where 
D = Diagonal(c?i , . . . , d n ) > and c = (ci,...,c„)' e M™ with at least two com- 
ponents Cj 7^ 0. With the cf ip*(t,u) of dj(Xj ± Cju/dj) 2 , u e R, j = l,...,n, 
A = 1 =F Sj=i c |/^i — and the numbers t k — kir/supQ, k e N, the cdf Fq of Q 
is given by 
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F Q (x) = (28) 

exp(±Ut fe w 2 ) Vritk ~[[ip*{t k ,u)\ ■ (1 - cos(t k x)) /k du 
-°°fe=i ^ j=i ' 

and i/ie density /q is square integrable for all n> 3. 

Remarks : The condition otj < was only introduced to get [0, sup Q] as the range 
of Q. With different intervals [ctj, bj] and indefinite forms Q general Fourier series for /q 
and Fq can be derived in the same way. 

The corresponding generalization of (14) can also be derived if the paths 7^ are 
replaced by the j kv in (27) with a sufficiently large number v, depending on the An 
numbers aj , bj , Cj , dj . 

Proof of theorem 2 : The upper signs below refer to the negative sign in D =p cc'. 
From theorem 1 in Roycn (2007b) the cf /q is given by 

/oo n 
exp(-Xy 2 )Y[^(t,y)dy (29) 
3=1 

with ipj(t,y) = £cxp (^—dj (\/—it Xj + Cjy/dj) 2 ^ , t, y € R. Turning the way of inte- 
gration in (29) by =F 7r /4 we get 

/oo n 
exp(± Uiu 2 ) Y[ 1>j (t, u)du. (30) 
■00 j=1 

With the numbers t k we obtain by the same method as used for (18), (19) the estimates 

|^(i fc ,u)| = 0{t- k 1/2 ), ueR, 

and (31) 
\i>*(t k ,u)\ = O^Vl" 1 ), cj + 0, \u\ > uo, 

with O-constants depending on / and a 3 , 6j , Cj , dj , and a sufficiently large number it 
depending on the 4n numbers a, , bj , c 3 , dj . Therefore 



1/^)1 =o(V"- 1)/2 ), 



and consequently Jq G £ 2 (0, supQ) for n > 3. 

The Fourier cosine series for Fq from (9) with fQ(t k ) obtained from (30) is again 
absolutely and uniformly convergent for all n > 2 and the assertion in (28) follows by 
the change of summation and integration in this series, justified by the majorant derived 
from (31). □ 



This section is concluded by some formulas for ip(t, z) = £ exp 
If X has a polynomial density / of degree d we get 



1>(t,z) = \V-™/t cxp(-z 2 )/(-iv^7t— ) (cxp(z 2 )(crf(z + V^)-erf(z))) (32) 



or equivalcntly 

pz+^—it 

ip(t, z) = yfijt / exp(— w 2 )f((w — z)/\/—it)dw 



= E f (k) (-*/>/=**) ^ (mi, -w 2 ) (w/v^i) k+1 /(k + ly. 

k=0 



z+V—it 



(33) 



In particular, 



p{t, u) = £ / (fe) (-«)i^i ^ 2 ) « fc+1 /( fc + !) ! 



fc=0 



(34) 



Writing z fc as a linear combination of Hermite polynomials Hj the functions 

/ e- z \ k dz = !Fi (^±i, ^, -™ 2 ) W fc+1 /(fc + 1) 
Jo 

are representablc by linear combinations of erf(u>) and Hj(w) exp(— w 2 ), j = 0, . . . , k— 1. 

A general formula for any bounded or unbounded density / with the moments Hj 
is given by 



W,z) = exp(-z 2 )^^^(z)(-V^)V. ? !. 

3=0 



(35) 



There is an alternative if the cf / = f c + if s of / is available. With the "base functions" 



ip2m(t, z) :— tp(t, z; exp(2rmrix)) 

= ^y/in/t cxp (— im 2 TT 2 /t) exp {^.mirzyj— i/tj erf (^z + xy/—it + rrnry/—i/tj 
t > , 



(36) 
i 

x=o' 



in particular with z = uy '—it 



(37) 

= ^\A^A CX P (—im 2 ir 2 /t) cxp {—2mmu) erf (V— it (u + a;) + miry/— i/tj 



x=0 
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and 

i/>c,m(t,z) := ip(t,z;cos(mnx)) = 
ip s ,m(t, z) := ip(t,z;s'm(mirx)) 

± """A exp (-im 2 7r 2 /(4t)) • 



^exp (—m-KZ^J '—i/tj erf + x\J —it — ^rmnj—i/tj ± 
cxp [m-KZyJ '—i/tj erf + x\J —it + \^rm\\J — i/tj j 



1 

where the upper signs refer to Vc,m, an d with the Fourier coefficients /(— 2m7r) of / it 
follows 

V»(t, z; /) - /(-2m7r)V 2m (i, «), (39) 

if / G £ 2 (0, 1). Similar series are obtained from a Fourier sine or Fourier cosine series 
for /. 

For unbounded /, and in particular for / ^ £ 2 (0, 1), we can use the following relation 
obtained by partial integration. 

tp(t, z; /) = cxp(-z 2 ) - 2zV^ii ip(t, z; 1 - F) + 2itip{t, z; (1 - F)id), (40) 

with id{x) = x and the cdf F of X. 

The Fourier cosine series for 1 — F{x) and the Fourier sine series for x(l — F(x)) 
have the coefficients 

^A Q = Ml = £(X), A m = 2f s (mir)/(mir) (41) 

and 

b m = 2 (/ s (m7r)/(m7r) - £(tott)) / (mw), m e N , (42) 
respectively, where denotes the derivative of / s = 3m(/). Therefore 

V>(t,z;/) = cxp(-z 2 )- (43) 

(OO \ OO 

MiV'o(^) + A m^c,m{t, z) j + 2ii ^ b m ip a>m (t, z). 
m—l / m—1 

Similar formulas are obtained from Fourier series for 1 — F(x) and x(l — F(x)) with 
the ONS {exp(2mirix)\m <G Z} on [0,1]. E.g. the cf of any beta density f(x) = 
a; p- 1 (l - x) q ^ 1 /B(p,q) is given by f(t) = iFi (p,p + q, it) with the derivative f'(t) — 
ip/(p + q) \F\(p + l,p + q + 1, it). Possibly, repetition of the step leading from (39) to 
(43) might provide even more rapidly converging series for ip. Unevitably, values of ip 
with large arguments are needed but there are at least some programs available which 
provide rather accurate values of some well known transcendental functions for large 
arguments. 



3. The distribution of Q for unbounded densities 
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A first idea of the order of magnitude of Fq(x), x — > 0, is found by the following 
inequalities. With the range R of the random sample X\, . . . , X n we get 

i R 2 < Q < i nR 2 , n > 2, 

and consequently 

P{R < 2{x/n) 1 ' 2 } < P{Q <x}< P{R < (2x) x ! 2 } . (44) 
At first it is useful to investigate the special density 

f p {x) =px p -\ < x < 1, <p< 1 . (45) 

Lemma 2. 

, < iw, , , { ' , (46) 

for Q belonging to the parent density f p from (45). 

Proof. P{R < e} = np J* (min(l, (a; + e) p ) - a;P) n_1 x^dx = np(h + I 2 + h), 
where the Ij denote the subsequent integrals on (0, e], (e, 1 — e), [1 — e, 1]. The last integral 
is 0(e n ) and can be neglected. 

h = e np f ((1 + t) p - fff- 1 t p - x dt = Ce np . 
Jo 

h = p n - l e np I* with 

/•l/e-l A/e-1 

J (1 + t^P-^-^t^dt <I?< J &-^ n dt. 



Thus 



1% < ((1 - P)n - I) -1 , n(l - p) > 1 ^=> np < n - 1 , 

^2 < m (V £ ) 7 = n — 1 , 

4* < ((P - 1)« + , np>n-l , 



and therefore 

j _ ( O ( £ ™«(«P.«-i)) j n p ^ n - 1 1 
2 ~~ I 0(e"- 1 ln(l/e) , np = n - 1 J " 

Now the assertion follows from (44) with e = (2.x) 1 / 2 . □ 
A more accurate asymptotic relation is also available: 

Theorem 3. The cdf Fq of Q belonging to the density f p (x) = px v ~ x , < p < 1, 
satisfies the following asymptotic relations: 

Fq(x) ~ C pn x s , x — > 0, 5 = min(np, n — l)/2, np 7^ n - 1 , (47) 
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F Q (x) ~ C pn x^l 2 Hl/x), np = n-l. 

With 

h±(y):=p exp(-(x±y) 2 ) X P- 1 dx 
Jo 

= 2-P/ 2 r(p+l)cxp(-y 2 /2)t/(p-l/2,±yV2) , 
where U denotes a parabolic cylinder function, the constants C pn are determined by 



{{h-{y)r + {h+{y)) n )dy, np < n - 1 
T(l + 5)y/n/n C pn = <j ty^n/fa _ !)„ + !) f np > n - I 

Xp^) n /2, np = n-\ 



Proof. The LT ]* Q of Q is 

/OO 
(A(t,»)) n dj/ 
-oo 



with 

\(t, ±y) = p J exp (-(±y + xVif^j x^dx 

= t~ p / 2 p / cxp (-(x ± y) 2 ) a^da; 
Jo 

< ^ p/2 ^(y) • 

With < y < (1 - e)\/i, < s < 1, we get for large t 

(■OO 1 

/ exp(-(a; - yf^x^dx ~ - V^F t< p " 1 )/ 2 erfc(Vi - y) 
M 2 

< i t^ 1 )/ 2 t- 1 / 2 e- 1 cxp(-e 2 t), 

and therefore 

A(t, -y) ~ t- p/2 ^(y), i ^ oo, < y < (1 - e)V~t . 
From (A.S. 19.12.3) and (A.S. 13.5.1) or from (A.S. 19.10.3) 
lim y 1 ~ p h~(y) = pVtt 

can be derived. Thus, 

/>OC 

/ (hp (y)) n dy < oo if < n — 1 . 
Jo 

Obviously 

\(t,y)~t-r/ 2 h+(y), t^oo , 



and 
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/ {h+{y)) n dy < oo . 
Jo 

Now we obtain from (52), (54), (55) 

f* Q {t) ~ V^A {{h-{y)T + (h+(y)) n ) dy^j r^ 2 , t ^ oo, (56) 

if np < n — 1 . 

To treat the case n > np > n— 1 the parameter e in (53) is replaced by e(t) — i -1 / 4 , 
but nevertheless it will be denoted by e for simplicity. 

POO 

1= / (X(t,-y)) n dy 
Jo 

is splitted into I\ + h + h + h over the intervals (0,(1 — s)Vi) 7 [(1 — e)^/t,^/t], 
{Vt,(l+e)Vt}, ((l + e)Vt,oo). Then 

r (l-e)Vi 

h t~ npl2 \ (h~ (y)) n dy ~ (pV^) n ((p - l)n + l)" 1 i"*"" 1 )/ 2 . (57) 

Jo 



A (t, -(1 - <*)>/*) = P y exp (-i(a; - (1 - S)) 2 ) x^ 1 dx 

= P(l - ^) P ^ cxp (-r(£ - l) 2 ) ^ d£ , 
Jo 

r = (1 - (5) 2 t, £ = (1 - S)- 1 ^ < (5 < e = t~ 1/4 , 

is investigated for t — ► 00. With the abelian part of the Tauber theorem for Laplace 
transforms of measures on (0, 00), see e.g. Feller (1971), it follows 



P (l-Sf /' 1 ex P (-T£ 2 )(l-0 p - 1 
Jo 

~ \ p£ exp(-rC)(l - VCr-'C^dC *\p^ t~ X 



d£ 

/2 



and 

rt 1 -^ 1 1 / n 

p(i - 6) p y exp (-r(e - 1) 2 ) e^ 1 ^-^ p ^ r crf ( 5v ^J • 

Thus, 

A (t, -(1 - <5)Vt) ~ 1 ^ ^ 1/2 (1 + erf(Jv^)) 

and 

J 2 ~ O (et-f"- 1 )/ 2 ) = o (t-f"- 1 )/ 2 ) . (58) 
Furthermore 



h < h 
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since \(t, -(1 + 5)y/t) < X(t, -(1 - S)Vt). With y = (1 + we get 
J 4 = t 1/2 ^ (p jf exp (-i(l - ar + u) 2 ) z p_1 efa;^ 



o t 



.1/2 



r-(-^*)-.(-(-^ 



Together with (57) and (58) it follows 
(\(t,-y)) n dy~h ■ 



/•DC 

L ( 



/o 

Then with 

POO POO 

/ (A(t, < i""*/ 2 / {h+(y)) n dy = o(/i) 

Jo Jo 

we obtain 

/5(t) ~ (pV?F) n ((p - l)n + I)" 1 t-f"- 1 )/ 2 , np > n - 1 . (59) 

Now from (56) and (59) the assertion (47) follows from the Tauber theorem mentioned 
above. The remaining case np = n — 1 is proved in the same way as (59). □ 

Theorem 3 suggests /q e £ 2 (0, q) if 5 > 1/2, which is proved by lemma 3 below. In 
view of this theorem a " too rapid" increase of the density / near to its end-points might 
cause some problems. E.g. let be h(y) = y~ x (ln(ey)) -2 , y > 1, and f(x) = h~ 1 (x) — 1. 
Then / is a monotone decreasing probability density on (0, 1] with 

lim x^/fix) = lim + y)) £_1 = , 

x— >0 y—>oo 

< e < 1, and / £ Z7(0, 1) for any p > 1 . 

Such cases and also "too strongly" oscillating densities are excluded for unbounded 
densities within the following class of functions. 

Definition : Let BVC*\p\, . . . ,p r , qi, . . . q s }(0, 1), < pi,...,q s < 1, denote 
the class of real unbounded functions / on (0, 1) for which there exists a function 
fo € BVC[0, 1], as defined in (12), and any numbers a\, . . . , a r , b\, . . . , b s with 

r s 

f(x) - ]T a^- 1 - bj(l - x)^- 1 = h(x) . (60) 
»=i j=i 

All finite linear combinations of beta densities belong to such a 73V r C*-class since 
every B(p, g)-density with < p, q < 1 differs from a suitable linear combination of x v ~ x 
and (1 - x)*- 1 only by a function from BVC[0, 1]. 

Theorem 4. Let X\, . . . ,X n be i.i.d. random variables with a density f € 
BVC*[pi, . . . ,p r , qi, . . . , q s ](0, 1)- Then the density /q of Q — Ym=\{Xi — X) 2 is square 
integrable for all n with 25 = min(np, n— 1) > 1, where p = min(pi, . . . ,p r , qi, . . . , q s ) and 
the cdf of Q is given again by equation (14) in theorem 1 at least for all n with 5 > 1/4. 
An integral representation for /q is obtained by the derivative under the integral in (14) 
for all n with S > 1 . 



15 



Remarks : A representation of Fq only by ip*(t,u) as in (13) is not given here 
because it would be numerically less useful. However, it can be shown that e.g. the 
condition ramin(p, 1 — q) > 1 with q = max(pi, . . . ,p r , qi, . . . , q s ) is sufficient for such a 
representation. 

For the proof of theorem 4 we need some majorants for ip(t, z), z <E jt, t > 0, for the 
special densities f p (x) = px p ~ x and f p i(x) = f p (l — x), < p < 1, and the resulting 
bounds for |/q(£)|. According to the three parts of the path j t , defined as in (7) with t 
instead of tk, we define 

i>+{t,y) = £exp(-(y + V^lA) 2 ^ 



4>-(t,y) = £exp(-(y + V^(l- A)) 2 ) , y > 0, 
%{t,u) = V+ (t,W=it) , -1 < u < , 



(61) 



with expectation referring to f p . 

The invariance of Q under the transformation A — > 1 — A is reflected by the easily 
verified relations 

<l>pi(t,y) = i>p{t,y), i>pi{t,y) = 4>£{t,y), 
r p i(t,-u) = -(!-«)), o< u <i . 

Hence, we have only to investigate the functions defined in (61). 



(62) 



Lemma 3. For the functions i>p,ipp, defined in (61), the following estimates hold 
for large t: 



\^(t, y)\ = min(l, y~ p cxp(-y 2 ))0(t- p / 2 ), 



\r P {t,-u)\ ={ 

(0 < u < 1) 



' o(t-p/ 2 ) 

Oit^^uP- 1 ), u > c/Vt, P > 1/2 
_ 0(t- p u- p ), u > c/Vt, p < 1/2 



(63) 

(64a) 
(646) 
(64c) 



with a suitable constant c > l,t > c 2 and O -constants depending only on p. 
Furthermore, for large t the cf fq(t) of Q is absolutely bounded by 



- [ m,z)\ n \dz\ = { 



' 0(t s ), 5 — min(np, n — l)/2, np ^ n — 1, np > 1 ' 
0(t-( n -V/ 2 ln(t)), np = n-l 

0(t-("f- 1 /2) ) l/2<np<l 

, O^" 1 / 2 ln(i)) np = 1 J 



^ (65) 



with O-constants depending only on p andn. 



Remark : Obviously, the exponent within the O-term for np < 1 does not give the 
correct order of magnitude for /q. This is a consequence of the coarse estimation of 
I Jo (^^dul by J Q \ip*\ n du given below. On the other hand these estimates are at least 
sufficient to ensure the square integrability of /q for np > 1 and this is considered as a 
minimal requisite for a reasonable rate of convergence in the representation (14) for Fq. 



16 



PROOF : With t > 0, y > and 

|exp(-(y + x^ii) 2 )\ = | cxp(-(y + xy/T/2 - ixy/tj2) 2 )\ 

= exp(-y 2 - xyV2t) (66) 

we get 

\^{t,y)\ < eM~y 2 )pj(p,yV2t)(yV2iy p 

< 2- p / 2 T(p + l)t- p ^y' p exp(-y 2 ). (67) 
To obtain an estimate for small y and t > 1 the integral i[)£(t, y) is decomposed into two 
integrals I\, I 2 over the intervals (0, i -1 / 2 ], (t _1 / 2 ,l] respectively. Obviously we have 

For 1 2 we find after partial integration 

h = ^ pVtF V^A ( crf (y + xv^itj - erf (y)) x p_1 |* =t _ 1/2 
1 Z" 1 

— 2 pV^y/i/t J ^ (crf (y + xV—itj — erf(y)^ (p — l)x p ~ 2 dx . 

With the number max{erf(z) | |3?e(z)| > |3yn(z)|} and suitable constants d,i — 1,2,..., 
only depending on p, it follows 

\h\ <Ci t- 1/2 + c 2 r^ 2 , 

and therefore 

l^ + (t,y)l <c 3 t- p / 2 , t>\. 

Together with (67) this yields (63) for ip+ . 

To find a majorant for tp~ we proceed as before: 

W>p(t,y)\<p[ cxp(-y 2 -yVtt + xyV^xP- 1 
Jo 

= cxp ( - y 2 - yV^tFi (p,p+l, yV2t) . 
With (A.S. 13.5.1) and y\fi > c, where c is a suitable constant, it follows 

\%{t,y)\ <Ci exp(V)(yV2T) 9 (V(p+l)cxp(-yV2t) +p(yV2iy * 

< C 2 y' p exp(-y 2 )t-P/ 2 . (68) 
On the other hand we obtain after partial integration 

ipp (t, y) = ~\ P\/^Ji (erf (y) - erf (y + V^ity ) 

~\ P ^J (~^ / ~^) 1 ( crf (y + v /r it(l-a;)) -erf (y + v 73 ^)) (p - l)x p - 2 dx. 



dx 



The last integral is splitted again into I\ + 1 2 over x < t x l 2 and x > t 1//2 , t > 1, 



17 



and we find 

rl/Vt , 

dx 



n/vt 

/i| < J |(-xV^i) _1 [cr{(y + V=ii(l - x)) - ctf(y + y/=it)j | (1 - p)x p_1 

< C 3 t- p/2 , 

h\ < C 4 r 1 ' 2 f (l-p)x p - 2 dx < C\ t- p/2 , 
Ji/Vt 



l/Vt 
and therefore 

\%{t,y)\<C 5 t>\. 

Together with (68) this implies (63). By the same way (partial integration and splitting 
the integral) the assertion (64a) is obtained. 

Now a bound is given for ip* containing u. After the substitution x — > x / \fi we get 



-p 
with 



C(t, -u) = pcMitu^t-P^ih - I 2 ) (69) 



/>oo 

Ii = I cxp(—2ixu\ // t)exp(ix 2 )x p ^ 1 dx 
Jo 



and I 2 with the same integrand over oo). The first integral is evaluated 
by a combination of the parabolic cylinder functions U{— p — 1/2, ±(1 + i)uy/i), 
U(-p - 3/2, ±(1 + i)uVt). By means of (A.S. 19.12.1), (A.S. 19.12.3) and (A.S. 13.5.1) 
(or only empirically by plotting) it can be shown that 

|/i (-2uVt) | < Ci (usfty min (P'^P) , uVt > c , (70) 

with a suitable constant c > 1. By partial integration of exp(itu 2 )I 2 we obtain 

\I 2 \ < J ^ erf (V^i(x - uVT)) - erf (-/^(l - u)) (1 - p)^" 2 dx 

and together with (69) and (70) it follows 

\r P (t,-u)\ < c 2 t-w + a t-^ 2 {uVt)- mhi ^ 1 - p \ 

which implies (64b) and (64c). 

Finally, the cf /q is absolutely bounded by 

V^A ( V~t£ \r p (t, ~u)\ n du + ^(^(t, V)\ n + HZ(t, 2/)|") dy. 

Due to (63) and (64a) the second integral and y/i f° /Vl \%\ n du are 0(t~ pn / 2 )-tcrms. 
If p > 1/2 then 



sft f 1 du = 0(t- [n -^l 2 C u [p -^ n du) 

Jc/Vt V Jc/Vi > 

•n - 1 

(71) 



cl-Ji ^ Jcl-Jt 

' ^-min(np,n-l)/2) ^ n p ^ n - 1 

O (t-(«-i)/2 ln(t)) , rcp=n-l . 
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If p < 1/2 then pn < n — 1 and 

Vt f 1 icr du = o(t i/2 - np f 1 u - np du) 

Jc/Vi V Jc/Vt ' 

2 ) nn 1 

(72) 



c/Vi ^ Jc/Vi 

jo (i""P/ 2 ) + 0(i-("P-V2) , ^ 1 



\0(t-^ 2 ln(t)) , rcp = 1 . 

If > 1 then mm{np/2, np - 1/2) = np/2 and (65) follows from (71) and (72). □ 

Additionally, for the proof of theorem 4 the following lemma is applied. 
Lemma 4. Let be f { e C 1 {0,1), i = 1,2, 

4>i(t, z) = J exp ^ — (z + x\J— it) 2 ^j ,fi(x) dx, t e R, z e C , 
{7fc|fc € N} the family of paths defined in (7), 

Mtk)~ f \M*k,z)\ n \dz\ and 4>{t k ):= f (|ci^i| + |c2^|) n H 
with any numbers C\,C2- Then the assumption ^2 k (f>j{tk) < oo, i = 1,2, entails 

Proof : It has only to be shown that tf^Jytk) < oo for the numbers 

<l>mn{tk)= I \i>i\ m \H n - m \dz\, m=l,...,n-l . 
From Holder's inequality it follows 

Ct>mn(tk) < (Mtk)) m/n (Mtk)) {n - m)/n , 

and again with Holder's inequality for sequences we get 

(\ m/n / \ (n — 7n)fn 

E^fe) <oo. □ 

k / \ k J 

Proof of theorem 4 : 

For a probability density / € BVC*[p\, . . . ,p r , qi, . . . , q s ](0, 1) the corresponding func- 



tion ij)(t, z) = £ exp (z + xyj— it) 2 ^j i 



r s r+s 

i=l j-1 i=Q 

with belonging to /o € £?yC[0, 1]. 



is a linear combination 
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The pairs (Y^iLo c m+i^m+i), m = 0, . . . ,r + s — 1 satisfy the assumptions of 
lemma 4 if 2(5 = min(np, n — 1) > 1 with p = min(pi, . . . ,p r , qi, . . . , q s ), and we obtain 

^4> 2 {t k ) < oo 

k 

for the numbers 

Htk) = I m k ,z)\ n \dz\ 

and a fortiori 

2 



k 



< CO, 



which proves the square integrability of /q if 5 > 1/2. 
Furthermore, because of (65) in lemma 3, the series 

oo 

(2/tt) E S ™ (/«(**)) (! - cos(t^))/fc 
fe=i 

is absolutely and uniformly convergent at least for all n with S > 1/4. The change of 
summation and integration is justified by a majorant for the integrand derived from 
(63) and (64) in lemma 3 and repeated use of Holder's inequality. For the same rea- 
son the density Jq is obtained by the derivative under the integral for all n with S > 1. □ 



The last theorem provides only a coarse estimation for the order of magnitude of 
Fq(x) for small x under rather general assumptions for the parent density /. 

Theorem 5. Let f be any probability density on (0, 1), 

r := sup{p|/ e £ p (0, 1)} < oo and n{\ - l/r) > 1 . 

Then 

F Q (x)=o(x s ), x^O, 
at least for all 6 < S r := (n(l - l/r) - l)/2. 



PROOF : If n(l — l/r) > 1 then there exists a p < r with n(l — 1/p) > 1. Then with 
9 = p/(p — 1) an d Holder's inequality we obtain for 

X(t,y) = £ cM-{y + xVi) 2 ) 

the bound 

X(t,y) = \\f\\ p Q cxp(-q(y + xVi) 2 )dx 
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It follows for the LT /£ of Q 

/OO 
(A(t,y)) n 

< 11/11? (^f) ((crf(v^(?y + Vl)) -crf(v?y))/V^)" /9 dy 

= 0(t-*'). * -» oo, S p = (n/q - l)/2 = (n(l - - l)/2 

with an O-constant depending on p, n and /. Therefore 

F Q [x) = 0[x s '), x^O, 
and consequently 

Fq(x) = o(x 5p ), x — > 0, for all p with £> p < <5 r . □ 

Unfortunately, without any further assumptions we cannot conclude from theorem 
5 to jq E C 2 for n(l — 1/r) > 2. Good estimates for |/q(£)| are not easily found from 
the only assumption / G £ p (0, 1), p < r, and it seems to be difficult to determine the 
lowest sample size n for which e.g. Jq € £ 2 . This was the reason for the introduction of 
the classes BVC*[pi, . . . ,p r , qi, . . . , q s }(0, 1). 

As an example we look again at the density f a (x) = ai a_1 , < a < 1, belonging 
to £f (0, 1) for all p < 1/(1 - a). For 1< an < n - 1 

F Q (a;) = 0{x an ' 2 ), x^O, 

follows from (46) or theorem 3. Theorem 5 yields only 

F Q (x)=o(x 6 ), 6< (cm -l)/2. 

This discrepany is caused by the weak assumptions for / in theorem 5 and by the coarse 
estimation with Holder's inequality. 
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